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The central challenge of implementing scalable quantum 
computing is to protect quantum systems against noise and 
decoherence while retaining the capacity to perform compu- 
tation. Quantum error correction and fault tolerant techniques 
provide a solution to this problem, and a variety of construc- 
tions for fault tolerant quantum computation have been pro- 
posed 0]-@). In all these schemes, a delicate balance must 
be maintained between coherently manipulating the encoded 
system while preserving the protected subspace and prohibit- 
ing the proliferation of errors. For example, for schemes built 
on stabilizer codes |5 1 transversal gates have the desired prop- 
erties, while in topological systems, topologically protected 
braiding operations J2] provide the logical gates. While much 
work in quantum computation has focus sed upon qubits (two- 
level systems), it is known that for any prime d, effective codes 
exist for storing eHevel quantum systems (5]-[7l. Thus qudit 
systems are also candidates for scalable fault tolerant quan- 
tum computation. 

In many approaches, the protected unitary gates are a sub- 
set of the so-called Clifford group. The stabilizer operations 
(comprising Clifford unitaries as well as preparation and mea- 
surements in the computational basis) are known to be effi- 
ciently classically simulatable |5]|6][8], an d on their own are 
not universal for quantum computation. Furthermore, several 
theorems have shown Il9l fl2l that, in general, there is a ten- 
sion between providing protection against generic noise and 
achieving universal quantum computing. 

Despite these obstacles, fault tolerant universal quantum 
computing is possible (TJ. One particularly successful ap- 
proach, known as state-injection, is to achieve universality 
by augmenting the fault tolerant operations with a supply of 
many copies of a suitable ancillary resource state. While 
methods for direct preparation of sufficiently noise-free pro- 
tected resource states have been proposed Q), a particularly 
elegant solution can be provided by distillation techniques, 
where many noisy copies of a resource state can be distilled 
to arbitrary fidelity by using only error-protected operations, 
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FIG. 1: An outline of a round magic state distillation protocol. 
Within many architectures of fault tolerance quantum computing a 
large proportional of the device is committed to these magic state 
factories. Each attempt uses n copies of a state p, and when suc- 
cessful outputs a state p' oc Given n successful attempts, 
these are used as inputs into the next iterate. Within the magic states 
model the completely positive map, £ , is composed of a sequence of 
Clifford unitaries and Pauli measurements. This figure illustrates a 
protocol where n = 4, for example the ququint, d = 5, protocol that 
we discuss throughout the article. 



while preserving the error threshold of the model. 

Here we consider the typical but idealized case, where 
the available protected operations are perfect stabilizer op- 
erations. As such whenever we speak of a qubit we mean 
a qubit encoded into either a topological system or stabilizer 
code that provides protection of stabilizer operations. This has 
become known as the magic states model and was first stud- 
ied by Bravyi and Kitaev [13] who proposed two protocols 
for distillation of qubit magic states, the resource for state- 
injection of a non-Clifford unitary. In parallel, Knill proposed 
the concept of a post-selected quantum computer |4] Q4] \T5l 
that used state preparation protocols that appeared distinct to 
magic state distillation, but were later shown to be equiva- 
lent |fl6ll . These techniques are a key component of many 
fault tolerance schemes, including for example the topologi- 
cal cluster state scheme ll3l [T7H22ll . 

Additional protocols were later discovered by Re- 
ichardt lfl6l l23l l24l . which increased the family of qubit 
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states known to be distillable. Conversely, Campbell and 
Browne ll25l l26l showed that no finite iterative protocol could 
distill all mixed nonstabilizer states. Many other results have 
contributed to our understanding of the magic states theory for 
qubit systems 1 27-30] and a 5-qubit distillation protocol has 
been implemented in an NMR system OTI . 

The theory of higher dimensional quantum computa- 
tion Il32l435l . stabilizer operations and error correcting 
codes [36 37 1 is well known. However, higher dimensional 
magic state models have been largely neglected until recently. 
In anyonic systems the dimensionality of the available stabi- 
lizer operations is determined by the underlying physics, and 
so with some physical systems we would have no choice but 
to work in the higher dimensional model (see e.g. Il38l I3910 . 
Recent progress on this problem has centered on exploiting 
a discrete phase space, or Wigner function, representation of 
quantum states R0H421 . Notably, Veitch et al. 021 showed 
that states with positive Wigner functions can never be used 
as a resource for magic state distillation. Although all sta- 
bilizer states have positive Wigner functions, there also exist 
undistillable nonstabilizer states and so bound magic states. 

These developments on no-go results took place without 
any known distillation protocols in higher dimensions. How- 
ever, recently we have proposed a protocol for 3 -dimensional, 
qutrit, systems that uses a generalization of the 5-qubit 
code (43). Magic state distillation was observed there, but the 
error suppression was slower than in qubit protocols. Here we 
present a family of protocols that distil magic states in any odd 
prime dimension and do so with a quadratic reduction in noise 
per iteration. As such the protocols are competitive with (and 
in some cases outperform) the best previously-known qubit 
protocols. 

Our protocols exploit higher dimensional quantum Reed- 
Muller codes fl4ll and so generalize the qubit protocol of 
Bravyi and Kitaev lfl3l that used a 15 -qubit quantum Reed- 
Muller code. This 15-qubit code was, to our knowledge, first 
developed by Knill, Laflamme and Zurek ll45ll and later de- 
veloped by Steane J46j. These quantum codes are constructed 
from classical Reed-Muller codes fl47M52ll . which have played 
a pivotal role in classical coding theory. Notably, the family 
of Reed-Muller codes includes the infamous Reed-Solomon 
code used for communication with the Voyager space probe 
and data storage on compact disks. 

We begin with a formal description of the Clifford group 
and the magic states model. This allows us to state our main 
theorem; roughly that magic state distillation is possible in 
higher dimensions. Next we review some basic theory of 
quantum error correction and show what properties of error 
correcting codes would enable us to build a protocol for magic 
state distillation. This sets the stage for constructing codes, the 
quantum Reed-Muller codes, which have the required proper- 
ties. Next we introduce some additional tools from classical 
coding theory that helps to simplify our analysis for a uniform 
depolarizing noise model. 

We consider several measures of performance for many 
protocols applied to systems of up-to 19 dimensional sys- 
tems. The measures indicate that two protocols for qutrits 
(3 -dimensional systems) and ququint (5 dimensional systems) 



perform well compared to both qubit protocols and protocol 
for even higher dimensional systems. For these two protocols 
we investigate their performance in more detail. We find that 
the qutrit protocol performs well, but that the ququint outper- 
forms all other known magic state protocols, both, in terms 
of the degree of error on the initial state it can tolerate and 
the efficiency of the protocol. We will see that the effective- 
ness of these protocols can be related to properties of the Clif- 
ford group. The startlingly good performance of these proto- 
cols make higher dimensional systems a enticing alternative 
to qubit systems. 

Finally, we show how to perform state-injection to convert 
the distilled magic states into non-Clifford gates. The addition 
of any non-Clifford unitary to the set of n-qudit Clifford gates 
gives a set of gates dense in SU(gP) and so approximately 
universal via the Solovay-Kitaev theorem. This fact is well- 
known for qubits, but is also true for general prime d, which 
follows from theorems proven by Nebe, Sloane and Rains in 
their recent book |53 1 (see Appendix |D)>. 

I. STABILIZER OPERATIONS AND THE MAGIC STATES 
MODEL 

We are interested in d-dimensional quantum systems, or qu- 
dits, where d is an odd prime. The computational basis states 
are labeled by j € F^, where denotes the finite field of d 
elements. For such systems, the so-called Pauli group, Vd, is 
generated by 

z = xyii><?'i. 

where © is addition modulo d, and lu = exp(i2ir/d). The 
conjugation relation, XZ = lu~ 1 ZX, is easy to verify and 
used throughout. The Pauli group over n qudits, V^, is the n- 
fold tensor product of the single qudit Pauli group. Consider 
an Abelian subgroup of the Pauli group, S, which contains 
the identity but no other multiple of the identity, e.g. cot ^ S. 
Associated with this group is a physical subspace, called a sta- 
bilizer code, and a projector onto this subspace, n cx J2ses s - 
We equate n with the code and call the group S the stabilizer 
of the code. When the code is 1 -dimensional, the projector 
describes a pure quantum state, which we call a pure stabi- 
lizer state. We will also follow common terminology and call 
any probabilistic ensemble of pure stabilizer states a stabilizer 
state, even when there does not exist a unique stabilizer group 
describing the mixture. 

The Clifford unitaries C d l are those that conjugate the Pauli 
group to itself, so Q = {U; UV 7 d l U^ = V d \ U G U(d m )}. 
The whole Pauli group is a subgroup of the Clifford group, 
V d C ■ Gottesman [ 36 1 introduced several other Clifford 
gates, including the single qudit gates 

p = E^' (j - 1)/2 li>01, m 

H = (E^ k ^ k \j)(k\)/Vd, ( ' 
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and the 2-qudit gate, SU M (or generalized CNOT) gate, 

SUM = J2\j)ti\® (3) 

3 

These gates have been shown to generate the whole Clifford 
group |54|. The magic states model also allows the imple- 
mentation of so-called Pauli measurements. Given any Pauli 

U € V2, which we express as U = X)fc=o cjfc ^fc' we a U° w 
for POVM measurements with elements {Uk}- It is common- 
place, though a modest abuse of terminology, to speak of mea- 
suring the Pauli U. 

For an n-qudit system the space of possible density matri- 
ces is within the set of bounded operators, B(H d ), acting on 
H d . For such a space the set of physical stabilizer operations 
allowed in the magic states model is captured by the follow- 
ing. 

Definition 1 Consider a completely positive map £ : 
B{T~L d m ) — > B(H d out ). The map is a stabilizer operation if 
and only if it can be composed from the following: 

1. Clifford unitaries; 

2. measurements and subsequent projections on stabilizer 
subspaces; 

3. preparation of fresh ancilla in a stabilizer state; 

4. tracing out of unwanted audits; 

5. adaptive decision making based on both measurement 
outcomes and random coin tosses. 

The number of qudits output and input may differ, as is typi- 
cally the case when magic state distillation is performed. 

II. M-TYPE GATES AND M-DISTILLATION CODES 

Every code defines an iterative scheme for magic state dis- 
tillation. However, some codes are much more suitable than 
others, and their usefulness can often be inferred from abstract 
properties of the code. In particular, the 15-qubit Reed-Muller 
code exploited by Brayvi and Kitaev has a very special prop- 
erty. There exists a product operator, of the form JJ® n , that 
acts on the logical basis as a non-Clifford operator. Such a 
code is said to have transversal non-Clifford gates, and we 
will consider generalizations of the qubit Reed-Muller codes 
with this remarkable property. 

The transversal non-Clifford gate of the 15 qubit code, 
the so-called tt/8 gate denoted U v /$, has another additional 
interesting property; For all Pauli P G V^, we have that 
Ujr/sPU^g G . Gottesman and Chuang defined this set 
of gates as the second level of an infinite hierarchy of qubit 
gates |55|. The hierarchy generalizes easily to quqits. 

Definition 2 The k th level of the Clifford hierarchy for n 
quqits is the set 

Q l (fc) = {u\yp G V2, upu^ g c n d {k - 1)}, (4) 

where the bottom level is the Pauli group C^(0) = V^. 



The hierarchy is defined recursively with the k level as the 
set of unitaries that conjugate the Pauli operators to a unitary 
in the (k — l) th level. The bottom level is fixed as the Pauli 
group and the first level is simply the Clifford group C^(l) = 
C™{. Whereas higher levels are not groups. The quqit gates of 
interest share these properties with the qubit U n /g gate and are 
defined as follows. 

Definition 3 The set of gates M.™ contains all M such that: 

1. M is diagonal in the computational basis; 

2. M dm = 1; 

3. M G SU(d); 

4. M G C^{2)/C\{1); 

We outline the motivation for these criteria and remark that 
M can be remembered as short for magic. Conditions 1-3 
will be directly related to the transversality of the gate for our 
quantum Reed-Muller codes. Furthermore, if we express the 
eigenvalues of M as exp(i2Aj7r/d m ) then condition 2 entails 
Xj are integers and condition 3 is satisfied when 2~2j Aj = 0. 
Condition 4 requires that while M is a member of the sec- 
ond level of the Clifford hierarchy, it is not a member of the 
Clifford group itself. From this we conclude that the operator 

C M = MXAP (5) 

is in the Clifford group but is not a Pauli operator. The eigen- 
states of Cm will be the attractor of our distillation protocols, 
which is why it is essential that Cm is a non-Pauli operator. 
Distillation would be possible without requiring that Cm is 
a Clifford operator, but demanding this property provides us 
with tools that improve the protocols efficiency. We observe 
that these sets form their own hierarchy, such that for any 
to < to' we have A4™ C M.™ . This holds because, al- 
most trivially, M dm+1 = (M dm ) d = l d = 1. We remark also 
that if M G M™ then Mt g M™ and we use this feature 
throughout. 

For every such set that is non-empty we will design proto- 
cols that distill eigenstates of Cm- However, we need to know 
that such gates exist. In the qubit setting, the 7r/8-phase gate 
provides such a unitary for to = 4. However, for to < 4 it 
is easy to check that all qubit gates with the form required by 
conditions (1-3) of the above definition are Clifford unitaries 
and so fail condition (4). Remarkably, for all odd prime di- 
mensions d > 3 we can find such gates for in = 2, and when 
d > 5 these gates exist for to = 1 (56). Using tall brackets to 
denote binomial coefficients we have the following. 

Theorem 1 For all odd primes d, there exists a gate M such 
that 

1. for d = 3we have M G M n d l for all to > 2; 

2. for prime d > 5 we have M G M.™ for all m > 1. 
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One such gate is the following 

M = £exp(i2A i 7r/d ro )|j)(j|, (6) 

3 

with 




We refer to this M as the canonical Aid gate. 

In particular, the canonical Aid gate is associated with the 
non-Pauli Clifford unitary 

C M = MXAp cx XP, (8) 

where P is the Clifford gate introduced earlier in Eq. d2l. 
Clearly, a different M exists for every dimension d. For no- 
tational clarity we suppress this d dependence. The proof is 
elementary, but for completeness given in App. [A] That odd 
prime dimensions can produce the desired gates with smaller 
m is no mere technicality, it has far reaching benefits for the 
magic state distillation in higher dimensions. We also remark 
that these gates, for d = 3,5, are Clifford equivalent to those 
found in Ref. ATI to be the most robust to depolarizing noise 
before becoming stabilizer operations. This family of gates 
was discovered independently and concurrently to this paper 
by Howard and Vala [56], and many other important proper- 
ties of such gates can be found there. 

The eigenstates of Cm are non-stabilizer states, which we 
label \M k ). We note that \M k ) = M\+ k ), where |+ fc ) is an 
eigenstate of X with eigenvalue w fe . We aim to use magic 
state distillation to purify copies of \Mq) from noisy copies, 
and in turn to use these for fault-tolerant state-injection of the 
magic unitary M. This brings us to our main result. 

Theorem 2 Consider any M £ Ai™ for any odd prime d 
and any integer m > 2, or any odd prime d > 5 and m > 1. 
There exists a stabilizer operation, £, that iteratively distils 
the magic state \Mq). The map £ takes n = d m — 1 copies of 
a audit state p, where 

e = 1 - (M \p\M ). (9) 

With non-zero probability the protocol outputs a state p' oc 
£(p® n ) such that 

e' = l-(A/ V|A4). (10) 

There exists a K > such that for all e we have e' < Ke 2 . 
Consequently, there exists a threshold e* > such that ifO < 
e < e* then e' < e. 

Notice that after a single iterate, using as input noisy \Mq) 
states, the protocol will output a noisy |Mq) state. By per- 
forming an even number of iterations a fixed state can be dis- 
tilled. We call this phenomena cycling, and in many cases it 
may be prevented by some Clifford unitary correction. How- 
ever, cycling can be desirable as it provides us with a mech- 
anism for producing both \Mq) and |Mq) states. The rate of 



error suppression is always quadratic, and so these results give 
the first better than linear error reductions in higher dimen- 
sional systems. 

The Clifford unitary Cm plays a practical role in several 
steps of our protocols. First, it is used for Cm —twirling, 
which is a process for converting input states into a canoni- 
cal form. By randomly choosing an integer, k — 1, d and 
applying C M we twirl any quantum state into the \M k ) ba- 
sis. Hence, all qudit states, p, can be twirled into a form that 
depends only on d — 1 independent parameters, such that 

\ E C MP(C k M y = fk\M k )(M k \. (11) 

ke¥ d k 

Our distillation protocols seek to increase the value of /q. We 
will see that Cm is also used in our protocols for Clifford cor- 
rection, which significantly increases the success probability, 
and as part of the final state-injection. 

III. MAGIC STATE DISTILLATION PROTOCOLS 
A. CSS codes 

Calderbank, Shor and Steane identified a special class of 
quantum codes, which in their honor are now known as CSS 
codes [57|. These codes have stabilizers generated by two 
subgroups, Sz and Sx, which contain only Z k and X k terms, 
respectively. Therefore, the code projector has the form = 
IT,s x n,s z . All CSS codes, can also be described by a pair of 
classical vector spaces, which correspond to Sz and Sx- If 
we have a vector u £ and a single qudit operator, U, then 
we define the n-qudit operator 

U[u] =®l =1 U Uk . (12) 

The k th element of the vector, u, tells us what multiple of U 
acts on the A: th qudit. It follows that for every s £ Sz we can 
find a u such that s = Z[u]. In fact, Sz = {Z[u}; u £ Lz] 
where Cz is a linear vector space. The closure of the stabi- 
lizer group under multiplication is easily seen to directly cor- 
respond to closure of Lz under additional modulo d. Simi- 
larly we can find a linear code, Cx, for Sx- The whole sta- 
bilizer must be Abelian and so for all u £ Cx and v £ Cz 
we require (u, v) = (BjUjVj = 0. Furthermore, for any code, 
C, we define the dual code C^ = {u; (u, v) = 0,Vv e C}. 
In terms of duality, commutation inside the stabilizer equates 
to Cx C C^ and Cz C C x . The dimensionality of the du- 
als are related by Dim(£- L ) = n — Dim(£), where n is the 
dimension of the vector field they inhabit, namely FJJ . For a 
CSS code k = n — Dmi(Cz) — Dim(£x) gives the number 
of logical qudits supported by the code n. 

Here we are solely interested in stabilizer codes of only 
d dimensions, that is a single logical qudit. It is useful to 
specify a basis spanning the code, which we again do using 
Pauli operators Zl and Xl- These are the so-called logi- 
cal operators of the subspace and they must commute with 
the code stabilizer. Whereas, with respect to each other the 
logical operators must conjugate in the same way as Z and 
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X, such that X L Z L = u>~ 1 Z L X L . It follows that there ex- 
ists an orthonormal basis, of stabilizer states that obey 
ZlMl) = ^Ul), X L \j L ) = \ j L 8 1), and which we call the 
logical basis. In this basis, the code projector can be expressed 
as II = J^j \3l) (Jl\- We also make use of the X -basis that we 
denote \ for single qudits stabilized by lo~ J 'X and | + ) for 
logical encoded states stabilized by uj~iXi,- Typically, such 
logical operators can also be expressed in terms of vectors, 
such as Xl — X[u] where commutation of Xi with Sz en- 
tails u C C^ and Cz C u^. 

Given this vector description, a useful fact is that Cz = 
(span(£x, u )) ± where the span(..., ...) is the vector space 
generated by its arguments. Let us prove this by first ob- 
serving that since Cz C u- 1 and Cz C C x we have that 
Cz C (span(£x, u)) ■ That Cz can be no smaller than this 
set follows from dimension counting; that is 

Dim[(span(£x, u)) ] = n — Dim[span(£x, u)], 
= n — Dim(£x) — 1. 

Since we have a single logical qudit, k = 1, we know also that 
Dim(Cz) — n — Dim(£^) — 1. Since the dimensionalities 
match, the assertion is proven. Taking also Zl = Z\y\ and 
noting (C- 1 ) 1 - = C, many such results for single qudit codes 
can be deduced by similar reasoning, 

C z = [span(£ x ,u)]- L , (13) 

Cz = span(£x,u), (14) 

C x = [span(£ z , v)]- 1 , (15) 

C x = span(£ z ,v). (16) 

We employ the above relations throughout. 

The smallest unitary capable of non-trivially acting on the 
code gives the robustness of the code to noise. For CCS 
codes it suffices to consider phase and bit flip noise sepa- 
rately. For an operator U[u] its "size" is measured by the 
Hamming weight, \u\h = {4h x j\ x j 7^ 0}, so the number 
qudits upon which the operator acts non-trivially. The ro- 
bustness to phase noise is measured by the distance, Dz = 
min{|v|^f ; Z[v]II = ZlII}, and for bit flip noise D x = 
min{|v|# ; X[v]II = X]JX\. The overall distance of the code 
is D = min{Dx, Dz}- Finally we remark that for any code 
there always exists a Clifford unitary that decodes, such that 
UZ L W = Zx and UX L W = X x . 



B. Suitable codes 

We now define the broad class of quantum codes that we 
show can be used to distill these magic states. 

Definition 4 An n-qudit stabilizer code, IT, is an M.™- 
distillation code if all of the following hold 

1. all M G M.™ are transversal such that 
M® n n(M®")t = MjTLMl ; 

2. it has distance, D > 2; 



3. it has logical Pauli operators Xl = X[l] and Zl = 

z[(d-m 

We have introduced the vector shorthand 1 = (1,1, ...1). No- 
tice that we require a special kind of transversality, such that 
the logical operator, Mjf, is implemented by applying M® n . 
The need for complex transposition will be explained later, 
and will be seen to result in a cycling phenomenon in the dis- 
tillation protocol. 

Here we show that all TVf ^—distillation codes can be used 
to perform distillation for magic states of the form \Mq) = 
M\+o) for all M £ -M™. Due to cycling, after a single iterate 
using as input noisy \Mq) states, the protocol will output a 
noisy \Mq) state. 

Theorem 3 Given an n-qudit A4™ -distillation code of dis- 
tance D the following holds. For all M £ there exists a 
stabilizer operation, £, that iteratively distils the magic state 
\Mq). The protocol takes as input n copies of a state, p, where 

e = l-(M \p\M ). (17) 

With non-zero probability the protocol outputs a state p 1 cx 
£(p® n ) such that 

e' = \-{Ml\p'\Ml). (18) 

There exists a K > such that for all e we have e' < Ke D . 
Consequently, there exists a threshold e* > such that if0< 
e < e* then e' < e. 

Later we show the existence of the required codes with d = 
2, which will then entail Thm. [2] For now we show how to 
proceed given such a code. 

C. The protocol 

We prove the above key result constructively. Given an n- 
qudit A^™-distillation code and any M £ M™, we can per- 
form the following iterative magic state distillation protocol. 

1. Take n copies of the state p and Cm -twirl; 

2. Measure generators of the phase stabilizer Sz', 

3. Accept all outcomes, but perform a Clifford correction 
operator Cm[w] tuned to outcomes; 

4. Measure generators of the bit-flip stabilizer Sx', 

5. Postselect on all "+1" measurement outcomes; 

6. Decode the encoded qudit to a single qudit; 

7. Use the output labelled p' as input in the next iterate. 

When iterating the protocol, on the odd iterates we must re- 
place Cm by C M to account for cycling. We have not yet 
defined the exact setting of Cm[w], but will come to this in 
due time. For simplicity though, we begin with assuming that 
step 2 generates all "+1" measurement outcomes, for which 
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Cm [w] = 1. We explain later how the Clifford correction in 
step 3 increases the success probability. 

After CM-twirling the n-copies we have a state 



P 



0n 



= J2 a v \M v ){M v \ 



vGF" 



where 



and 



|M V > = \M Vl )\M V2 )...\M Vn ), 



k<£¥ d 



(19) 



(20) 



(21) 



where wt^(v) is the fc-weight, the number of elements in v 
equal to k, and fk — (Mk\p\Mk) ■ We note that, 



P® n = M i ( E «v|+v><+v| ] M Ll (22) 

where M^ L = M® n . Upon a successful projection onto the 
code subspace, we have 




a v n|+ v )(+ v |n m l , 



(23) 



as the projector commutes with Ml- We need to determine 
the effect of each term n|+ v ), which we will find to be 



n|+v> 
n|+v> 



0;Vv^£i; (24) 
VH|+^);Vv® jl = w,s.t.w e Cz- (25) 



The first equation covers all v ^ C x and the second equa- 
tion covers all v g span(£z, !)• By virtue of Eq. (16i we 
know C x = span(£z, 1) and so these equations account for 
all possible v. The constant c gives the probability of this 
projection when the initial state is pure, 



c = tr(II|+o><+o|® n ). 



(26) 



Furthermore, |+)® n is an eigenstate of n,s x and so this ran- 
domness can be completely attributed to the Z stabilizer mea- 
surements, which can be made deterministic by Clifford cor- 
rection. Equations ( 24|25 1 follow directly from properties of 
error correcting codes, but for completeness more details are 
given in App.[B] 

In summary, the trans vers ality of M allows us to consider 
the distillation of magic states \M ) as equivalent to the sim- 
pler problem of distillation in the X basis. Combining these 
results, we have 



E 




M L . (27) 



The output state is diagonal in the basis M L \+^) rather than 
the desired Ml | +^ ) . We reiterate that this cycling is not prob- 
lematic as an even number of iterations always brings us back 



to the initial basis. Decoding onto a single qudit and using £ 
to denote the whole process, we have 



p' cx = c E E "v|Mj)(A/ 



(28) 



j£F d vffijle£ z 



By expanding out a v , we get an iterative formula for f' k 

(M k \p'\M k ), such that 



3 ~ p 



Wtfc(v) 



/,- = 



(29) 



which has been renormalized by dividing through by the suc- 
cess probability P. This probability equals the sum of the 
numerators, which is 



^=E E rU wtfc(v) - 



(30) 



The summation over all j, such that vffij 1 e Cz, is equivalent 
to a sum over all v g sp&n(Cz, (q — 1)1). Using the features 
of CSS codes (see Eq. 15 1 we know span(£^, (d — 1)1) = 



C x and so 



p= E LU wMv) - 



(31) 



Notice that we have dropped a factor of c from the success 
probability, which will be justified later by Clifford correc- 
tion. Both numerator and denominator of /' are polynomials 
of degree n, and can be calculated from the classical codes. 



D. Analyzing the iterative formulae 

Here we consider some properties of the above iterative for- 
mulae. First we consider simple depolarizing noise model and 
give a Taylor series approximation. Next, we consider a com- 
pletely general noise model and show the existence of a dis- 
tillation threshold. 

When the noise is depolarizing, and so fj^o = e/(d — 1) 
and /o = 1 — e, the formula for the fidelity simplifies to 



V££; 



ffl— \v\h f\v\u 

JO Jj^O 



-I v |ff fW\ H 



(32) 



where is again the Hamming weight. The factors /q 1 ap- 
pear on both numerator and denominator and so cancel. Mak- 
ing use of the shorthand 



M = 



fj¥=o 



f (d-l)(l-c)' 
we can further simplify the fidelity formula to 

l^vecz P 



fo = 



|v| H 



(33) 



(34) 



Such cases are easier to study as they depend on only a sin- 
gle parameter and the simple Hamming weights. Indeed, we 
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will show later, in Sec. IV E that this simple form can be fur- 
ther simplified by leveraging some powerful techniques from 
classical coding theory. For now we make some casual obser- 
vations concerning quadratic error suppression. 

Taylor expanding the numerator and denominator to second 
order we have 



Jo 



1 



0(/i 



(35) 



where a (b) is the number of weight d elements of Cz (£~x)- 
Both Cz and C^ contain a single weight zero element, v = 
= (0, 0...0). By definition both contain no other elements 
with weights smaller than d. Further approximating the de- 
nominator and using f = 1 — e' yields 



■a)n u + 0(fi n+L ) 



(36) 



So the suppression of errors is degree D as fi ~ e. In particu- 
lar, since D > 2 the error suppression is at least quadratic. 

The depolarizing noise model is useful for illustrating the 
salient features of a distillation protocol. However, it is im- 
portant to demonstrate error suppression and existence of a 
threshold for all possible noise models. Again we rescale the 
noise parameters to /i^ = fk/fo, and so 



Jo 



EC: 



-1 wtfc(v) 
= 1 



rr d_1 ,, wtf =( v ) 



(37) 



Both Cz and contain v = for which wtfc^o( v ) = 0, 
and so both numerator and denominator contain a term equal 
to 1 . We make a very coarse lower bound on the numerator, 
which must be greater than 1 since all terms are positive. We 
wish to upper bound the denominator less coarsely. First we 
define fj, — max^o{/it} and use it to replace all other noise 
parameters in the denominator, yielding the inequality 



This gives us a valid constant K = dPC as asserted in Thm. 
|2] and Thm. [3] The existence of some distillation threshold 
follows quickly. If we consider e* = K^^ ^ 1 ^ , we find 
that if < e < e* then e' < e. The above analysis is very 
general, but the corresponding bounds will be far from tight 
and a much higher e* will exist. 



E. Clifford correction 

So far we have assumed that the Z stabilizer measurements 
all yield the desired " + 1" outcome. Next we consider the 
process of Clifford correction, as outlined by step 3 of our 
protocol. This additional strategy significantly increases the 
success probability of each round, so much so that success 
is guaranteed in the limit of pure initial states. The gen- 
eral idea is that for any measurement outcomes, with result- 
ing projector Tl' Sz , there exists a Clifford Ca/[w] such that 
Cm [w]IT^ z = ns z Cm [w] . The key fact exploited is that 
for a single qudit Cj\fZ = oj~ 1 ZCm, and so for many qudits 
Cm[w]Z[v] = w-< w - v >Z[v]Cm[w]. To proceed we must 
specify the projector IT^ z . We begin by expressing the lin- 
ear code as Cz = {Gu : u £ F™} where m = Dim(Cz) 
and G is an m by n matrix called the generator matrix of Cz- 
Each column of G gives an individual generator of Cz and 
hence Sz- When the measurement corresponding to the j th 
generator gives outcome u ' , the resulting projection is 



n'« 



i 



Sz 



J2 uj {Ku) Z[Gu}. 



(42) 



Conjugating with a Clifford correction Cm [w] yields 

CmWs z = ^ E ^ <k ' uMw ' Gu> Z[Gu]C[w], (43) 



,|v|h 



(38) 



Recall that v = contributes 1 to the summation and all other 
terms are upper bounded by yP where D is the distance of the 
code. Hence we have 



1^1 



(39) 



1. 



where C is the number of nontrivial terms, C 
Clearly for real x we have 1 > (1 — x 2 ) and so 1 > (1— x)(l + 
x) and positivity of x entails (1 + x)^ 1 > (1 — x). Using this 
with x = Cjj, D gives f > 1 — Cfi D and furthermore 



e' < C[i D < C 



1 - e 



(40) 



where e' = 1 — /q. We assume without loss of generality 
that /o is larger than all other fj, which allows us to bound 

(1 — e) _1 < d and so 



e' < d u Ce 



D 



(41) 



and so the correction works when for all u we have (k, u) = 
(w, Gu) mod d. We can always choose a canonical form for 
the generator matrix, such that G = (ll m |G'), where the iden- 
tity acts on the first m rows of G and G' labels the remain- 
der of the matrix. For such a canonical generator matrix we 
choose w to equal w = (k%, k2-..k m , 0, 0, ...0) so it matches 
the measurement outcomes on the first m entries. This yields 
(w, Gu) = (k, u) and so Clifford correction achieves its goal. 



IV. REED-MULLER CODES 

A. Some concrete examples 

Our demonstration of magic state distillation in higher 
dimensions was conditional on the existence of a M™- 
distillation codes, as specified in Def. |4] Before introducing a 
family of A4™ -distillation codes for all odd prime d, we give 
some concrete examples. We label the codes as QRMd(m) 
where d is again the dimensionality and m dictates the codes 
size and transversality properties. 
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Definition 5 Q1ZA43(2) is a CSS code over n — 8 qudits of 
dimension 3. The Cx code is generated by 

u 1 = (1,2,0,1,2,0,1,2), (44) 
u 2 = (0,0,1,1,1,2,2,2). 

Whereas, Cz is the code generated by 

v x = (1,2,0,1,2,0,1,2), (45) 

v 2 = (0,0,1,1,1,2,2,2), 

v 3 = (0,0,1,2,0,2,1,0), 

v 4 = (1,1,0,1,1,0,1,1), 

v s = (0,0,1,1,1,1,1,1). 

The logical operators are = Z[21] and X^ = X[l]. 

For the above qutrit code, we will find that it is transversal 
with respect to the canonical non-Clifford gate, as in 
Thm.Q] 



M 




(46) 



where r = exp(z27r/9). 

In the introduction of suitable non-Clifford gates, Thm. [T] 
showed that for odd primes greater than 3, it was sufficient 
to set m — 1 to find non-Clifford gates. Remarkably, this 
means that we can find even smaller codes with transversal 
non-Clifford gates. The smallest such code exists for d — 
5, and as we shall see later it performs exceptionally well at 
magic state distillation. 



Definition 6 QRM 5(1) is a CSS code over n 
of dimension 5. The Cx code is generated by 



4 ququints 
(47) 



ui = (1,2,3,4). 

Whereas, Cz is the code generated by 

V! = (1,2,3,4), (48) 
v 2 = (1,4,4,1), 

The logical operators are — Z[A1] and = X[l]. 

For the above code we find it is transversal with respect to the 
canonical VW5 non-Clifford gate, 



M 



/w 3 

' u 

uj- 



V 



(49) 



where here ui = exp(i27r/<i) = exp(i27r/5). Notice how the 
eigenvalues are all multiples of uj. For dimensions smaller 
than d = 5 any diagonal gate with phases that are multiples of 
uj will be a Clifford gate rather than a non-Clifford as desired. 
This property makes it possible in higher dimensions to find 
smaller codes with a transversal non-Clifford. 



From this information one can numerically verify that both 
codes are well defined and have the correct transversality 
properties. Transversality can be verified by calculating the 
effect of the non-Clifford gates on the logical basis states. 
Over the following sections we develop an analytic proof that 
these features are valid for a whole family of quantum codes. 
Further details of the performance of these codes are given 
later, but we hope the examples help guide the reader through 
the general case. 



B. Classical Reed-Muller codes 

Here we review d-ary generalizations of Reed-Muller 
codes [49 -52] and derive the crucial properties we exploit 
later. Convention dictates that we denote Reed-Muller codes 
as lZA4d(u,m), where d tells us the relevant field, u is the 
order of code and m determines the size of the code. Here we 
explicitly use only Reed-Muller codes of 1 st order, so u = 1. 
All Reed-Muller codes are defined by polynomials of a de- 
gree bounded by the codes order. For order 1 Reed-Muller 
codes we must consider degree 1 polynomials, that is linear 
functions. The dual of a Reed-Muller is another Reed-Muller 
code, though it may have a different order [49 -52 1. In this 
way higher order Reed-Muller codes do enter into our work. 
However, it is sufficient for us to define them in terms of du- 
ality. Ultimately, we will not use these codes but their smaller 
shortened versions introduced in the next section. However, 
for pedagogical reasons we first review the unshortened vari- 
ants. 

We begin with a review of linear maps. There are d m linear 
maps from F™ onto F^. All such maps, g a : F™ — > F^, can 
be labeled by vectors themselves, say u 6 F™, and then the 
function will evaluate to <?u(a) = (u,a) = (BjUjdj, again 
modulo d. Next, we consider another mapping, U™ : F™ — > 
FJJ, where n — d m , such that 

UJ*(u) = ((u,a ),(u,a 1 ),...(u,a„_ 1 )), (50) 

where &j is the base d representation of the natural number 
j. For example, with d = 3 and m = 2 we would have the 
ordered set 

{a,} = {(0,0), (0,1), (0,2), (1,0), 

(1,1), (1,2), (2,0), (2,1), (2,2)}. 



Hence for u = (0, 1) we have 

tff [u] = C/ 3 2 [(0, 1)] = (0, 1, 2, 0, 1, 2, 0, 1, 2). 



(51) 



For any d and m (positive integers), the set C = {u = 
[/™(u); u £ F™} is a linear vector space. Closure of the vec- 
tor space under addition follows directly from the closure un- 
der addition of homogenous linear maps. The codes of interest 
are constructed by considering all affine functions, which are 
linear maps plus an additional constant c such that they map 
uto[/™(u)©cl. 

Definition 7 Unshortened Reed-Muller codes, TZM.d(l,Tn), 
are classical linear codes on FJ where n = d m , of dimen- 
sion 771 + 1. They are the set of codewords lZA4d(l, m) = 
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{U™(u) © cl : u € F™, c G F,j} defined in terms of ajfine 
functions. 

Such codes have many exotic properties. Before investigating 
them we introduce one more definition. 

Definition 8 We say a function A : F d — > Z is a X-function 
if there exists a set of d integers {Ao,..Ad_i} such that 

EjGFd X J = and 



A(v) = £)A 0j . 



(52) 



The reader should note that A-functions are closely related to 
the non-Clifford gates introduced in Def. [3] Our main obser- 
vation here is the following. 

Lemma 1 Given a X-function A and an unshortened code 
TLMd{l, m) all v e TZMd{l, m) satisfy A(v) = mod d m . 

To prove the lemma we first consider codewords where u = 0, 
and so v = (c, c, c.c), then 



A(v) = d m X c 



(53) 



which vanishes modulo d m . Let us now consider the code- 
word for the unit vector, u = (1,0, 0..0), and c = 0. The 
corresponding codeword has a repetitive structure as in Eq. 
(51 1, where each element of F d appears d m_1 times. Hence, 



d-l 

A(v) = d'-^A^O, 

3=0 



(54) 



since we required in definition of a A-function that 
X^=o X j = The above argument looks tailored to code- 
words for a unit vector u, but a similar argument holds for 
all codewords with non-trivial u. That is, for any non-trivial 
u there are d" 1 ^ 1 different linear maps that evaluate to each 
possible output. To see this, consider that the family of lin- 
ear maps is invariant under change of variables that preserve 
linearity. Hence, the family of functions can always be ex- 
pressed in a basis such that u is a unit vector. Furthermore, 
these codewords have uniform multiplicity of every value F d , 
and so adding cl will only reorder the elements and not the 
multiplicity with which they appear. This proves our lemma. 

In summary, unshortened Reed-Muller codes have a huge 
amount of symmetry that they inherit from the families of 
affine and linear maps. However, they actually have too much 
symmetry for our purposes. We break just enough of that sym- 
metry by shortening the code. 



C. Shortened classical Reed-Muller codes 

Given a code L over F d , the corresponding shortened code, 
denoted C* , is over F^ _1 . It contains all the codewords of 
C with in the first position and that position deleted. The 



process of shortening is closely related to puncturing, where 
the first position is removed but all codewords are kept. We 
can also give a self contained definition of a shortened Reed- 
Muller code as follows. 

Definition 9 Shortened Reed-Muller codes, TZM.* d (l,m), are 
classical linear codes on F d , where n = d m — 1, of dimension 
to. They are the set of codewords 1ZM. d (l, to) — {P™(u) : 
u € Fd} defined in terms of linear maps. 

Here P™ is the same map as U™, but omitting the first ele- 
ment. For example, the shortened version of Eq. ( |5"T| > is 

P3 2 [u]=Pf[(0,l)] = (l,2,0,l,2,0,l,2), (55) 

which the reader may have noticed is also one of the gener- 
ators of the Cx code for the quantum QR.M^(2) code re- 
viewed earlier. Notice that the self-contained definition of the 
shortened Reed-Muller code makes use of only linear maps 
and not affine maps. In the unshortened code we had a gen- 
erator 1 that corresponded to the constant term in affine func- 
tions. However, when shortening a code we only keep code- 
words with zero in the first position and so the 1 generator is 
dropped. For this reason the dimension of the code drops by 
one; Dim(KM* d (l,m)) = Dim(KM d (l, to)) - 1. Let us 
now consider the shortened analog of Lem. [TJ 

Lemma 2 Given a X-function A and a shortened code 
HM* d {l, to) all v G KM* d (l, to) satisfy A(v © cl) = -A c 
mod d m . 

This follows quickly from Lem.[TJ Given a v G 1ZM d (l, to), 
let us define 



w = (0,vi,v 2 , ..v n ) © cl, 

= (c, V\ ® c, v 2 ® c, ...v n © n), 



(56) 



where clearly w is a codeword of the unshortened code 
lZA4 d (l, to). Furthermore, A(w) = A(v) + A c as it has an 
extra term appended. However, Lem.[TJtells us that A(w) = 
and so A(v) = — A c . We will soon see that Lem [2] is inti- 
mately related to transversality of quantum gates for an asso- 
ciated quantum code. 



D. Quantum Reed-Muller codes 

Here we construct quantum codes from shortened Reed- 
Muller codes for general m and d. 

Definition 10 QlZMdim) with to > 1 is a quantum CSS 
code over n — d m — 1 audits of prime dimension d. The 
codespace is defined by 

1. C x = nM* d {l,m); 

2. C z = [span(£^, l)]- 1 ; 

3. X L = X[l}; 

4. Z L = Z[(d- 1)1]. 
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We could have equivalently specified £z as a higher order 
Reed-Muller code, though the above is simpler. We first check 
that QTZMd(m) codes are indeed quantum codes. By con- 
struction, the stabilizer is Abelian as Cz C C x . It is easy 
to check the logical operators are well defined: that Zl com- 
mutes with the stabilizer; Xl commutes with the stabilizer; 
and X^Zl = w^ 1 Z^X^. Now our next main result can be 
concisely stated. 

Theorem 4 QTZMdi'm) quantum codes are M.™ -distillation 
codes of distance D = 2. 

The main property we need to prove is transversality for all 
M e M'7. As with all CCS codes, we have that 



\Jl) 



1 



A" 



E Mi 1 )- 



(57) 



vECj. 



Acting on this logical state with M®" gives 

M° n \jL) = E ex P fe A ( v ® J' 1 )) l v ® J' 1 )' 

V\ L x\ v€Cx v a 1 

(58) 

where A is a A-function (Recall Def. [8} using the integers 
{Aj} associated with the eigenvalues of the unitary M. Now 
we use our key lemma[2]to conclude 

M® n \j L ) = E exp(-2i 7 rA,-/ C r)|v©jl), 

V\ L x\ ve£x 

= ex p(-2inX j /tr)\j L )=Ml\j L ), 

and so we can identify M® n with 

Proving a distance lower bound is straightforward as dis- 
tance 2 is the smallest non-trivial distance. The relevant dis- 
tance is D z , the smallest |v|jj such that it produces a logi- 
cal error Z[v]II = Z 3 L Il. For such an operator v 6 C x but 
v ^ 0, so the phase error commutes with the X stabilizer 
but is non-trivial. If such an operator existed with Hamming 
weight 1, it would entail that there existed a qudit upon which 
Lx acted trivially, which there is not. That the distance is not 
greater than 2 follows from the following section. 



E. Mac Williams identities 

We have introduced higher dimensional Reed-Muller codes 
and shown that they have suitable transversality properties for 
magic state distllation. Knowing the code stabilizer and us- 
ing Eqs ( 29|34 1 we can calculate the exact analytic formuale 
for arbitrary noise. For QTZAis(2) the general noise problem 
is tractable because Cz and C x are quite small sets, but the 
size and complexity of these sets grows rapidly with d and m. 
This is relevant because the fidelity after 1 iterate is calculated 
by summing over all elements in Cz and C x . By consider- 
ing depolarizing noise, the problem is partially simplified by 



Eq. ( 34 1, which we restate here as 



fo 



(59) 



where Wc (m) k known as a weight enumerator 



(60) 



vEC 



Weight enumerators have been extensively studied in classical 
coding theory fl47l . In particular, a weight enumerator for a 
code C can be related to the weight enumerator for the dual 
code C^- by the Mac Williams identity l47l 

w c ±{n) = <r Dim ( £ >[i + (d- iwwc (A) , 

where we use the shorthand 

1-fi 



l + (d-l)n' 



(61) 



Using Cz = [span(£x, 1)]^ = {^■'x) ± (see Eq. 
Mac Williams identity we have 



13 i and the 



fo = 



dWcAP)' 



(62) 



The codes Cx and C' x are much smaller and simpler than 
their duals, and so the Mac Williams identities has proven ex- 
tremely helpful. Indeed, for Reed-Muller codes we can find a 
closed form for these enumerators. When Cx = IZAidO-, m) 
we have (see App.|C]for details) 



iy £x (/~i) = i + (d m -i)A 



- ^^r l ( dm - d " 



(63) 



and 



W c > x {p) = W Cx (ti (64) 
+ (d- l)[A (dm_1) + (d m - l)fi ( ' dm - 1 - dm ~ 1) }. 

Combining all these formulae and reverting back to the origi- 
nal variables e gives a closed analytic form, which is manage- 
able albeit a bit long for reproducing here. Rather we present 
the Taylor expansion to second order in e 



(d m -l){d-2) 
2(d- 1) ' 



o\ 



(65) 



It is interesting that for all protocols based upon a quantum 
code QlZMdim) we see quadratic error suppression for all 
odd prime d and all m. Whereas the quantum Reed-Muller 
code used by Bravyi and Kitaev, <27\LM2(4), obtained a cubic 
reduction, such that e' ~ 35e 3 . Our analysis also describes 
the Bravyi-Kitaev protocol, the only difference being that in 
the qubit case we need m > 4, and so the above formula also 
holds for qubits. It is intriguing to observe that the factor (d — 
2) appears above and so the quadratic term would vanishes 
only in the qubit case, and so in higher dimensions these Reed- 
Muller codes are only distance 2. This is one of many curious 
differences between qubits and odd prime dimensions. 
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V. PERFORMANCE OF PROTOCOLS 

Here we consider various aspects of the performance of 
our protocols. We begin by showing that our protocols yield 
magic states at a rate that scales only polynomially with the 
desired final error probability. We then use MacWilliams 
identities to analyse thresholds under depolarizing noise mod- 
els for much larger codes. Next we consider in more detail 
the performance of our protocol based on QR.Mj,{2) and 

A. Yields 

The overall performance of a protocol can be captured by 
its yield. Given some target error probability, we calculate the 
yield as the expected fraction of the initial copies that achieves 
the goal. By definition, for any protocol and any distillable 
state p, with error probability e; n , there exists a number of 
rounds N(p, etargct) required to achieve etargot- If on the k th 
round of distillation the success probability is the yield is 
simply 



Y(p, Ctarget) 



n 

k=l...N 



(66) 



where n is the again the number of copies use per iterate. We 
are interested in how this scales as etargot vanishes. The suc- 
cess probability is continuous in e and approaches 1 as e van- 
ishes, thus Pk approaches 1 as k increases. Therefore, for 
all p < 1 these exists a c such that for all k > c we have 
Pk > P c = p. This allows us to lower bound the yield such 
that 



Y(p, etargot) > C 



P 



N-c 



(67) 



where C is a constant overhead, independent of etarget, that 
represents the yield for c iterations. Furthermore, after c iter- 
ations the error probability is now e c . Next we observe that 
for a single round we know e' < Ke D for some K, equiva- 
lently Ke' < (Ke) D . Therefore, the error probability after N 
iterations, ejv> satisfies Ke^ < (Ke c ) . Taking Ke c < 1 
allows us to bound the number of iterations needed such that 



N- 



c < 



log 2 



jgg(ftajggt/jO 

log^/K) 



(68) 



For positive a and b we have the identity a logD ' b ' = b lo&D ^ a \ 
which combined with the above equations entails that 



Y(p, Etargot) > C 



log( e target/^Q 

. Iog(ec7*0 



log D (P c /n) 



(69) 



With the shorthand 7 = — \og D (P c /n), which is positive, we 
have 



Y{p, etargot) > C 



logfergot/^) 7 ' 



(70) 



This decreases by a factor polynomial in e^ gGt . Conversely, 
the expected resource cost of distillation is the inverse yield, 
and this increases only polynomially in e^J; t- The scaling 
is governed by the factor 7 = — \og D (P c /n), but P c can be 
taken arbitrarily close to 1 . As such, the relevant scaling pa- 
rameter is 7* = \og D (n), and so 



Y(p, etargot) ~ 0[log(e t - rRCt /^r 7 *] 



(71) 



For our protocols in odd prime dimension we will find that 

D = 2 and n = d m — 1 so 7* = log 2 (e?™ 1 — 1), which we give 
in table U 



d 


m — 1 


m = 2 


m = 3 


m = 4 


2 


N/A 


N/A 


N/A 


2.46497 


3 


N/A 


3 


4.70044 


6.32193 


5 


2 


4.58496 


6.9542 


9.2854 


7 


2.58496 


5.58496 


8.41785 


11.2288 


11 


3.32193 


6.90689 


10.3772 


13.8376 


13 


3.58496 


7.39232 


11.1007 


14.8017 


17 


4 


8.16993 


12.2621 


16.3498 


19 


4.16993 


8.49185 


12.7436 


16.9917 



TABLE I: The yield scaling parameter, 7*, for distillation by 
QTZMd{ m ) as governed by Eq. 71 The smaller the value of 7*, 



the more resource efficient the protocol in the limit of many itera- 
tions. For qubit systems, the 10-to-2 protocol of Ref. [58| achieves 
7* = log 2 (5) ~ 2.32193, which is the best known value for qubit 
protocols. N/A indicates not applicable, as for those parameters no 
non-Clifford gates exist. 

Notice that the code QHM^l) achieves the best yield 
scaling of all quantum Reed-Muller codes. This accolade 
is retained by <27\LM5(1) even if we compare it with all 
presently known magic state distillation protocols. 



B. Depolarizing noise thresholds 

For some values of d and m, we have used the exact expres- 
sion for e' to find the depolarizing noise threshold e^ op below 
which distillation occurs (see Table |ll|below). This should not 
be confused with the absolute threshold e* that holds for all 
noise models and can be smaller. The threshold gets weaker 
for both increasing d and increasing m, as suggested by the 
above approximate formula for e' (see Eq 65 I. When we in- 
crease m, we increase the number of copies required per it- 
eration but decrease the depolarizing noise threshold. This 



makes it advantageous to use the smallest possible m such 
that M e M.™. The benefit of larger m is rather that a large 
set of states are distilled by the protocol. 

If we also compare our protocols with the threshold of the 
BK protocol for d = 2, the pattern of better threshold for 
smaller dimensions no longer holds. We see that the best 
threshold we observe is for QIZM^Y) with a fairly high 
threshold also observed for QR.J^ij,{2) . There are many sub- 
tle differences in the Clifford group between odd and even 
dimension, and here those differences work in our favor. In 
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d 


m=l 


m = 2 


m — 3 


m = 4 


2 


N/A 


N/A 


N/A 


0.14148 


3 


N/A 


0.211001 


0.0657764 


0.0214564 


5 


0.3631226 


0.0614718 


0.0119213 


0.00236986 


7 


0.2322599 


0.0291865 


0.00409851 


0.000584079 


11 


0.1341066 


0.0111835 


0.00100907 


0.0000916717 


13 


0.1106148 


0.00790156 


0.000604487 


0.0000464795 


17 


0.0818753 


0.00454655 


0.000266565 


0.0000156773 


19 


0.072453 


0.00362063 


0.000190054 


0.0000100014 



TABLE II: The distillation threshold e*[cp f° r depolarizing noise 
when distilled by QlZA4d(m). We include the threshold for the 
Brayvi-Kitaev protocol using 15-qubits, which uses a quantum Reed- 
Muller code Q1ZM2(^)- N/A indicates not applicable, as for those 
parameters no non-Clifford gates exist. 

odd prime dimension we can construct smaller codes with 
transversal non-Clifford gates. Our code Q1ZA4 5(1) uses 4 
ququints covering a Hilbert space of dimension 5 4 , which to 
our knowledge is the smallest non-trivial stabilizer code with 
a transversal non-Clifford gate. Furthermore, research to date 
indicates that smaller codes lend themselves to better thresh- 
olds. A plausible explanation is that larger codes allow more 
undetected errors. Most of these undetected errors will have 
a large Hamming weight, and so while negligible for small 
e, they will be damaging for the modest size e relevant for 
threshold calculations. 

Concerning thresholds for qubit protocols, we have fo- 
cused on the comparable protocol using quantum Reed- 
Muller codes. The qubit threshold can be slightly extended 
by using the 7-qubit Steane code [16] (e* — 0.14645) or 
the 5-qubit code lfT3l on a different class of magic states 
(e* = 0.1719). Though a slight improvement, both fall short 
of our qutrit and ququint thresholds and have much poorer 
yields. 

Before proceeding, we will remark on our notation and ter- 
minology for quantifying depolarizing noise. Throughout we 
have used e = 1 — (Mo|p|Mo) for the error probability. If a 
state suffers depolarizing noise, it has the form 

p = 5\M Q ){M \ + (l-S)t/d, (72) 

and in some parts of the literature S is used to quantify noise. 
Relating these two distinct noise measures we have 

£dcp = (d - l)S/d, (73) 

and so a dependence on the dimensionality appears. In terms 
of S thresholds appear larger, with QRM 3 (2) and QTZM 3 {2) 
having thresholds at 5 = 0.317 and 6 — 0.453 respectively. 
Some readers may find using 5 to be more natural as it may 
be related to the depolarizing noise rate of some unitary used 
to prepare the initial noisy magic states. However, when uni- 
taries suffer depolarizing noise, the best strategy is not simply 
to apply the noisy unitary to |+). Rather better thresholds can 
be achieved with noisy unitaries by using the noise dilution 
protocol of Howard and Vala ll56ll . Furthermore, the threshold 
boosts from noise dilution become more prominent for higher 
dimensions. 




Im(z) 

FIG. 2: The canonical CM-plane for a qutrit, d = 3, which any state 
can be projected onto by Cm -twirling. Every quantum state is a point 
in the complex plane for the complex number z p — tr(CMp)- The 
three pure magic states, \Mk), take values z = 1, w, uj 2 , which have 
2 1 2 = 1 and so lie on a circle in the plane. All physical states have, 
2 = (1 — /1 — /a) + wfi + u 2 f2, and so lie in the convex hull 
of the pure magic states, forming a triangle of physical states. The 
distillable region of states can, by use of the QTZM^ (2) protocol, be 
brought arbitrarily close to nearest pure magic state. The stabilizer 
states are the convex hull over the set of points, 2, taken for each of 
the pure stabilizer states. It is impossible to distil not only the sta- 
bilizer states but also the bound states, as demonstrated in Ref. 1421 . 
Note that the rotational symmetry is to be expected as the Pauli Z 
rotation performs a rotation in the Cm -plane. 



C. Peformance of QTZM 3 (2) 

Here we apply our methods to the 3 dimensional case us- 
ing Q1ZM.3(2), as explicitly defined in Def [5] In previ- 
ous work ||431 we have proposed other protocols for the 3- 
dimensional case, including a generalization of the 5-qubit 
code to qutrits. While magic state distillation was observed 
for this 5-qutrit code, these previous studies only showed a 
linear suppression of noise, whereas here we observe a more 
rapid quadratic suppression with each iteration. 

We take M to be the canonical A4 3 gate, as in Thm. [T]and 
Eq|46] By Cm -twirling all single qudit quantum states are 
projected onto the diagonal in the \Mf.) basis, such that p = 
Y,k fk\M k )(Mk\. When we wish to distil \M ), the weights 
/1 and f% represent different types of noise. A more conve- 
nient parameterization is fi — ecos 2 ((9) and f% = esin 2 (#) 
as we are mainly interested in how the total noise reduces. 
Our techniques allow us to find an analytic solution for e' af- 
ter a single iterate of magic state distillation with QR,M.^{2). 
However, the expression is lengthy so here we truncate to 3rd 
order 

e' = e 2 [3 + cos(40)] + e 3 [9 - cos (40)] + 0[e% (74) 

which is quadratically reduced. In Fig (|3^), we show the exact 
output error probability for the whole range of different noise 
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0.1 0.2 0.3 0.1 0.2 0.3 0.4 

Initial error probability e Initial error probability e 



FIG. 3: The output error, e' against input error, e for (a) Q.TZM S {2) and (b) Q1ZA4 5 (1). For a fixed e there are many different compatible 
states, and so there are many different possible output e' and these are shown as a region rather than single curve. For the worst case noise we 
mark the threshold e* . The dashed line shows the specific instance of depolarizing noise, and the associated depolarizing threshold e*j cp is also 
shown. The straight line is simple the "break even" line. 



models (different 9) and depolarizing noise (9 — tt/2 1 ). We 
find that a threshold of e* = 0.20015 for general noise and 
e dcp = 0.211001 for depolarizing noise (as cited earlier) . As 
such, for all 9, if < e < e* it follows that e' < e. We can 
also find a quadratic upper bound, such that for all e and 9 we 
have e' < Ke 2 with K = 5.03. The value of K is found by 
considering the function e'e~ 2 and numerically maximizing, 
so K = sup e g{e't~ 2 }- 

The region of distillable states is actually slightly larger 
than the e < e* region, with a greater noise tolerance for some 
values of 9. To find the whole distillable region we resort to 
numerics and present the results as part of Fig. [2] Several other 
important regions of the plane are also highlighted. We show 
the stabilizer states and bound magic states, which cannot be 
distilled by any stabilizer operation. Between these regions is 
a non-empty regime of ambiguous status, which neither our 
protocol works upon nor is ruled out from distillability by any 
known theorem. Even in the simple qubit case, such puzzling 
regimes exist and it has proven challenging to conclusively 
decide their status, see for example Refs. G5ll26l . 

Also important is the success probability of distillation with 
Q7vLM 3 (2), which for all states satisfies P > 1/9 and for 
small e is approximately 



P = 1 - 8e + [31 + cos(46>)]e 2 + 0(e 3 ). 



(75) 



Given these fairly high success probabilities and that we use 
only 8 copies per iteration, this protocol is competitive in 
comparison to the Bravyi-Kitaev protocol (herein BK) that 
also used Reed-Muller codes. Their protocol uses 15 copies 
per iteration and has P > 1/16 and for small e it achieves 
P = 1 - 15e + 0(e 2 ). Our QHM 3 (2) code requires fewer 
copies per iteration, but it would require more iterations to 
achieve the same error suppression as BK, since BK has a cu- 
bic error suppression rather than just quadratic. 

In Figs. (|4j laj4] lb) we consider the exact yield of our pro- 
tocol, compared against BK, assuming depolarizing noise, so 
9 = 7r/4. For small error probability e in < 0.05, the yield of 
our protocol Q7ZA43(2) is similar to BK. Both protocols give 



yields of the same order of magnitude and which protocol is 
superior fluctuates with variation in required iterations. How- 
ever, as the initial error probability e; n increases, the yield of 
QR.M.3,{2) exceeds that of BKby many orders of magnitude. 
The dominant effect here is that the yield of BK vanishes as 
we approach the threshold e* BK ~ 0.1415, whereas our proto- 
col can tolerate depolarization all the way upto e 



dcp 



0.211. 



The results of Sec V A also give us analytic tools for esti- 
mating yields. These show that for small etargot the yield of 
our protocol decreases as 



Y ~ O[log(e t -4et/5-03) 



(76) 



where 7* = log 2 (8) = 3. This can be compared with the 
BK protocol, which achieves a similar scaling with 7* = 
log 3 (15) ~ 2.46. We see these protocols have similar scaling 
properties, but BK performs slightly better in the large t 
limit. However, the numerical results reported in the previous 
paragraph show that finite size effects and a superior threshold 
often outweight these asymptotic arguments. 



D. Peformance of QKM 5 (1) 

Next we apply our methods to the 5-dimensional case using 
the code Q1ZM§(1), as explicitly defined in Def[6] This is the 
first protocol ever applied to the problem of distilling magic 
states in 5-dimensional systems. The code and associated pro- 
tocol have many distinguishing feature already mentioned: it 
is the smallest known non-trivial code to have a transversal 
non-Clifford; it has the largest noise threshold against depo- 
larizing noise (e^ cp = 0.363); and it has the best known scal- 
ing in terms of expected yield (with 7 = 2). All these fea- 
tures can be attributes to the fact that d = 5 is the smallest 
dimension where diagonal non-Clifford gate exist with period 
d, allowing us to work with m = 1. 

Again we take M to be the canonical M. 5 gate, as in Thm.[T] 
and Eq. (49 1. The C^-twirled states are parameterised by a 
fidelity, fo = 1 — e, and 4 independent noise parameters fj 
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for j — 1, 2, 3, 4. In Fig. (J3]?) we show the range of different 
output error rates for all different types of noise and the de- 
polarizing noise, which have thresholds of e* = 0.31195 and 
e* dcp = 0.363122. We noted earlier that the QKM 5 (i) pos- 
sesses the best known protection against depolarizing noise, 
but also see here that its robustness against generic noise is 
also unrivaled. 

Unfortunately, 5-dimensional systems are quite complex. 
Even after twirling into the Cm -plane, we cannot easily vi- 
sually represent the whole distillability region as with did for 
the qutrit protocol. For this reason we focus on the depolar- 
ized case with f = 1 — e and fj^ = e/4. After a success- 
ful implementation of one round, a depolarized state is output 
with 

, _ e 2 (96- 160e + 75e 2 ) 

64 - 256e + 480e 2 - 400e 3 + 125e 4 ' 
3f 2 7f 3 

~ -Y + ~Y + 0[£i] ' (77) 

and this occurs with probability 

_ (1 - 2e) 4 (64 - 256e + 480e 2 - 400e 3 + 125e 4 ) 
P ~ ~ 64(-l + e) 4 

~ l-8e+^ + 0[£ 3 ]. (78) 

Based on these results we expect the protocol to have an ex- 
cellent yield. We numerically studied the yield and again 
compared it against the qubit protocol QR.M.2{^) or Bravyi 
and Kitaev, see Figs. (|4]2aj4]2b). The numerics confirm that 
across all parameter regimes QR,M.^{1) offers a significant 
resource savngs of potentially many orders of magnitude. 
Magic state distillation is typically the most resource intensive 
aspect of fault tolerance schemes, and so high yield protocols 
are very desirable. 



VI. STATE-INJECTION AND UNIVERSAL QUANTUM 
COMPUTING 

Protocols for qudit magic state distillation are our main fo- 
cus, but what happens after preparation of a highly purified 
magic state? Our ultimate goal is to simulate a non-Clifford 
group unitary via state-injection. For the Cm -magic states of 
direct interest we show the following. 

Theorem 5 Consider any M £ M.™ and any noisy magic 
state a with e = 1— (Mo\a\Mo). There exists a trace preserv- 
ing stabilizer operation, Q, that deterministically implements 
state-injection such that for all p 

\\g(a®p) -MpM f ||i < 2e. (79) 

where ||..||i is the trace norm \\A\\x = tr(VAAt'). 

For perfect magic states, e = 0, this entails that Q (pm ® p) = 
M pM* . We first focus on the ideal case and later extend to 
noisy magic states. 



For qubit systems, any magic state on the equator of the 
Bloch sphere may be exchanged for a unitary randomly se- 
lected from a pair of non-Clifford phase gates 0131 . In pre- 
vious work, it was shown that a qutrit analog of the Bloch 
sphere equator [43 1 provides magic states that can be used for 
state-injection of non-Clifford phase gates. Here we review 
and generalize these ideas. 

Definition 11 We say a qudit quantum state |0) is equatorial, 
or a phase state, if&£ R d and 

|0) = ^E^b'). (80) 

It follows immediately that a \Mq) state is a phase state with 
Qj = 2\jir/d m . The essential feature of such states is that 
they are unbiased with respect to the computational basis, 
such that a Z measurement will generate completely random 
outcomes. Taking an unknown state \ip) and measuring ZZ^ 
on the pair | ©) | -0) will also give unbiased outcomes, and so 
no information is gained from Denoting a general state 
as \ip) = J2j c j |i)> tne result of a projection, life, onto a sub- 
space stabilized by uj^ k ZZ 1f yields 

n k \ip)\e) (xJ2 c ^ i&jmk \j® k )\j)- (8i) 

3 

We decode by performing a Clifford unitary such that \j 
k)\j) — > \k)\j) and tracing out the first system. As promised 
the result is a unitary transform, \ip) — > Uk(Q)\ip), where 

U k (Q) = Y^e ie '°*\j)(j\, (82) 

3 

which can also be expressed as 

U k (Q) = (X k )iU (e)X k , (83) 

where we note 

u (e)\+ ) = \e). (84) 

The transformation is unitary, but randomly selected from d 
different possibilities. 

How do we simulate a deterministic unitary required for 
a computation? Herein we consider unitary gates produced 
from a magic state, |Mo), such that 

U k = {X k )^MX k . (85) 

Again we exploit the relationship between M and the Clifford 
unitary C M = MXMl Noting that C k M = MX k M^ we 
express the unitary as 

U k = {X k )HlX k M^M, (86) 
= (X k yC k M M. (87) 

Therefore, we can recover the desired M unitary by applying 
the inverse of Clifford unitary [X yC^. 
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FIG. 4: The yield on a log-scale of of our protocols, Q1ZA4 L i(2) and Q1ZA4 5 (1), (blue) compared with the Bravyi-Kitaev Q7iLM2(4) (red) 
protocol. Plots (la) and (2a) are a function of initial error probabilities and target error probabilities e t argct- For the qutrit and ququint 
states the noise is depolarizing. Both (la) and (2a) come with a cross-sections (lb) and (2b) respectively, with the target error probability held 
constant. The sudden changes in yield occurs because of discrete changes in the number of iterations required. 



We have established a deterministic stabilizer operation, 
such that g(\M )(M \ ® p) = MpM^. We now relax our 
assumptions and allow the resource to be imperfect, so that 
1 — e = (Mo|cr|Mo). By CM-twirling we can ensure the state 
has the form a = (1 - e)\M ){M \ + «r' where \\o'\\\ = 1. 
Applying our map Q to the noisy state gives 

G(a ® p) = (1 - e)MpM t + eG{a' ® p), (88) 

Subtracting MpM' yields 

G(a ® p) - MpAft = eg(a' ® p) - eMpM^. (89) 

Taking the trace norm and using the triangle inequality gives 

\\g(a®p)-MpM% < e\\g{a f 9p)\\i + e\\MpM i \\i, 
= 2e. 

This is a rigorous treatment of the intuition that if the magic 
state is almost perfect then so too is the state-injection. 

The addition of non-Clifford M and M' gates to our reper- 
toire of unitaries generates a set dense in the special unitary 
group (see Refs. |[53ll59l and App. ID}, Furthermore, for every 
gate in this set its inverse is also contained in the set. Thus the 
Solovay-Kitaev algorithm can be applied to ensure an efficient 
approximation of any unitary. This argument also applies to 
the results of Ref J43), where the qutrit Clifford group was 
supplemented by a non-Clifford unitary but universality was 
only conjectured there. 



VII. DISCUSSION 

We have generalized the idea of magic state distillation us- 
ing quantum Reed-Muller codes to all prime dimensions, en- 
abling preparation of highly purified pure non-stabilizer states 
given a device capable of ideal stabilizer operations. By state- 
injection these magic states enable us to simulate universal 
quantum computation. While many aspects of the generaliza- 
tion were very analogous to the qubit case, there have also 
been some remarkable surprises. In odd prime dimension the 
non-Clifford gates we gain are fundamentally different from 
the phase gates implemented by the Bravyi-Kitaev protocols. 
In particular, we find that for primes d > 5 there exist quan- 
tum Reed-Muller codes of only d — 1 qudits that possess these 
non-Clifford gates as transversal gates, whereas 2 4 — 1 = 15 
qubits are needed for a similar construction. 

To our knowledge the ququint code {d — 5) using only 
4 ququints is the smallest non-trivial stabilizer code with a 
transversal non-Clifford gate. This translates into real prac- 
tical gains, with the ququint protocol achieving better error 



probability thresholds (see Sec. V B i than any other known 
protocol with a polynomially scaling yield; e^ cp = 0.363 for 
depolarizing noise. Calculating the yield of the ququint proto- 
col also shows that it is superior to all known qubit protocols, 
as demonstrated both by numerics and analytic scaling argu- 
ments. For larger prime dimensions, d > 5, the thresholds 
and resource costs deteriorate with increasing dimension. It is 
not presently clear whether this is an inevitable problem with 
higher dimensional systems or a peculiarity of our protocols. 
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We also investigate in detail the performance of an 8 qutrit 
(d = 3) protocol, which whilst not as effective as the ququint 
protocol was still competitive against qubit protocols. 

It is natural to question whether e, as defined in equation 
is a fair measure to compare noise thresholds in systems 
of different dimensions. It would be desirable to use a noise 
measure which is practically motivated based on a noise pro- 
cesses which could occur in the lab. In BT l. the depolaris- 
ing noise rate S is employed, where 5 measures the degree 
of depolarising noise of state p from pure state state via 
p = (1 — 5)\i/j)(ip\ + (p/d)i. For a depolarising noise model 
5 is related to e via e = ((d — l)/d)5. Quantifying error via S 
penalises higher dimensional states, yet even via this measure 
the thresholds for the 4-ququint code continue to significantly 
outperform their qubit counterparts. 

Nevertheless, one can argue that S is also an unfair method 
of comparison given the larger number of noise processes 
which contribute to depolarising noise for higher dimensional 
systems. Ultimately, for the context of magic state distillation, 
the most relevant measure of comparison would be the yield at 
the fault-tolerance threshold. Unfortunately, at present, fault 
tolerant quantum computation with higher dimensional sys- 
tems remains a little explored research area, and comparable 
thresholds to e.g. Knill Q or Harrington et al 0's schemes 
are unknown. We know of only one study of qudit fault 
tolerance thresholds [60|, and while evidence was presented 
that higher dimensional systems may provide better thresholds 
than their binary counterparts, the analysis in this paper is lim- 
ited. In particular therefore, our results motivate further study 
of an full fault-tolerance schemes based on ququint and qutrit 
components. It is possible that the enhanced performance in 
dimensions 3 and 5 seen in our magic state distillation proto- 
cols translate into better thresholds and resource costs for full 
fault tolerance schemes based on qutrits and ququints. 

Another application of our results it to models of computa- 
tion where the fault tolerant operations are a proper subgroup 
of the Clifford group. For instance, the qubit topological clus- 
ter states J3] [TTl cannot directly prepare Y eigenstates, but 
they can be distilled using magic state distillation. In qudit 
generalizations of the topological cluster scheme, we antic- 
pate that preparation of XZ eigenstates will not be topologi- 
cally protected. While we have focused on distillation of non- 
stabilizer states, our protocols also enable distillation of XZ 
stabilizer states. 

Our understanding of the magic state model is still in its 
infancy despite many striking similarities with the more ma- 
ture theory of entanglement. However, as reviewed in the in- 
troduction there has been a flurry of recent results on qudit 
magic state model. Numerous problems of a fundamental na- 
ture now present themselves as ripe for tackling. Inspired by 
entanglement theory, we might ask if their exist qudit proto- 
cols for magic catalysis lETTl l62l or magic activation l6Tl 1631 . 
Furthermore, while all known protocols offer yields of magic 
states with arbitrarily small error probabilities, the yield van- 
ishes as the target error vanishes. Contrast this with entangle- 
ment theory where the hashing protocol [64 65 1 and quantum 
polar-coding techniques 1 66 1 offer a method of distilling en- 
tanglement at a non-zero yield even for vanishing target error. 



Whether such a protocol could exist for magic state distilla- 
tion is an intriguing and wide open question. 

In the final stages of this research we became aware of re- 
cent work that proposes a novel protocol |58 1 for qubit magic 
state distillation. The protocol, which they call the 10-to-l 
protocol, takes 10 noisy magic states each iterate and outputs 
2 magic states. This is the first protocol to output more than 
1 magic state per iterate, and this has the benefit of increasing 
its yield. Potentially similar techniques could also be used to 
design higher dimensional protocols. 



VIII. ACKNOWLEDGEMENTS 

We acknowledge the financial support of the EPSRC, 
the Leverhulme Trust, the BMBF (QuORep) and the EU 
(QESSENCE). DEB was supported in part by the Perimeter 
Institute, and would like to thank them for their hospitality. 
We would like to thank Martin Kliesch, Daniel Gottesman, 
Peter Shor, Joe Fitzsimons, Debbie Leung and Mark Howard 
for helpful discussions. In particular, we would like to thank 
Vadym Kliuchnikov for bringing ll53l to our attention and for 
suggesting the argument given in Appendix [D] 

Appendix A: The canonical M gate 

Here we verify the assertions of Thm. [T]and show that the 
canonical Al is a member of A4™ for the asserted values of d 
and m. We begin by showing that 

C M = MXM^ oc XP. (Al) 

Left multiplying by X^ gives XHlXM^ oc P. The left hand 
side is then 

XHIXM^ = exp(*27r(A iffl i - \ 3 )/d m )\j)(j\. (A2) 

j 

This equals P, upto a global phase, if for all < j < q — 1, 

jrn— 1 / J 



(A3) 



for some c. We first solve for the cases where j 1 = j + 1, 
that is j 7^ q — 1. For this set of equations, we may use 
standard arithmetic and recurrence equation methods, and the 
general solution is 



A, 



i-i / J 



+ jc + Aq, 



(A4) 



for all j, where c and Ao are integers to be determined. These 
integer variables will be fixed by demanding that Eq. ( A3 1 
with j = d — 1 holds, and also that Aj = 0. First, let 
us impose the former condition and substitute Eq. ( |A4] > into 
Eq. ( |A3) l for j = d - 1, to yield 

Ao - \ d -i = Ao - (d™- 1 r ~ ^ + j(d - l)c + A ) , 



= d ri 



-i/O'-i 



17 



Solving this equation for c yields 



(A5) 



For m > 2, inspection reveals that c is integer-valued for all 
d. For m = 1, c is integer-valued for all prime d > 5. This 
follows from the fact that when to = 1, c = — (d— l)(d— 2)/6. 
We use the fact that 6 = 3x2. Since d > 5 is a prime number 
not equal to three d it is not divisible by three, thus either 
(d — 1) or [d — 2) must be divisible by three. Since d > 5 is a 
prime number not equal to two then (d — 1) must be divisible 
by 2. Hence the product (d — l)(d — 2) is divisible by 6 for 
all primes d > 5, and c is an integer for to = 1 and d > 5. 

It remains to fix Ao by imposing that . Aj = 0. Perform- 
ing the summation and simplifying, we find that 



A 



d+1 
4 



(A6) 



Again, for to > 2 this is (by inspection) integer-valued for 
all d. For to = 1, this is integer-valued for all prime d > 5, 
and the proof for this latter case is similar to above. When 
to = 1, A = (d + l)(d - l)(d - 2)/24. We observe that 
24 = 3 x 2 x 4. Since d > 5 is a prime number not equal to 
three d it is not divisible by three, thus either (d— 1) or (d— 2) 
must be divisible by three. Since d is an odd prime number 
both (d + 1) and (d — 1) must be divisible by two, and one of 
this pair must be divisible by 4. Hence (d + l)(d — l)(d — 2) 
must be divisible by 24 and consequently Ao is an integer for 
to = 1 and d > 5. 

Thus, the gate M as defined in theorem 1, satisfies all the 
requirements to be a member of Ai™. For to = 1 and d = 3, 
Xj, is not integer- valued for all values of j and so the above 
argument does not provide a member of Indeed, for d = 
3 it is easy to numerically search the sets of gates with integer 
Xj and verify that none are non-Clifford and so M\ is empty. 



Appendix B: Projection onto logical subspace 



( 24]25 



Here we present the reasoning that leads to Eqs 
which can be divided up into 3 cases: a detected error, no 
error and an undetected error. 

When v ^ C x , an error is present that is detected by 
the code and so the state vanishes, n|+ v ) = 0. To see 
this we recall that X\+k) — oj k \+k) and so more gener- 
ally X[u]|+ V ) = w< v>u> |+ v ). Projecting onto the "+1" 
eigenspace of all X[u] G Sx entails that the state will vanish 
unless (v, u) — for all u £ Cx- This is simply the require- 



ment that v is in the dual of Cx, which proves Eq. (24 1. 

For the "no error" instances, v £ Cz, the state does 
not vanish under projection. Furthermore, since |+ v ) = 
Z[v]\+)® n and UZ[v] = II we have n|+ v ) = and 
so all such states must be projected onto the same logical state. 
Finally, we observe that |+}® n is stabilized by X L = X® n 
andson|+)®" = y/c\+%). 

All other possibilities correspond to undetected errors, re- 
sulting in a projection onto other logical states. In such 



cases, v e C x and so there must exist a j £ such 
that w = v ffi jl € Cz- In terms of Pauli operators, we 
have Z[w] = Z[w]Z[jl] and so Z[v] = Z[w]Z[(d - j)l]. 
Since the logical operator is Zl = Z[(d — 1)1] it follows 
that Z[v] = Z[w]Z J L . In terms of the quantum state, we 
have |+ v ) = Z[w}Z 3 L \+)® n and so after projection n|+ v ) = 



-f) 



Appendix C: Weight enumerators 



Here we find the weight enumerators for the short- 
ened Reed-Muller codes Cx = ~R.M* d (l, m) and C' x — 
span(£x,l) as given in Eqs. (63 64 1. Since Cx C C' x it 
is natural to start with Cx and then add the remaining terms. 

First, Cx contains a zero vector (0,0,..., 0) with zero 
Hamming weight. Second, all the remaining codewords, there 
are d m — 1 such codewords, have (d— 1) zeros, i.e have Ham- 
ming weight n — (d — 1) = d m — d. Thus we have the weight 
enumerator 



W Cx (x) = l + (d m -l)a; 



{d m -d) 



(CI) 



The enumerator for C' x can be broken up into d separate sums, 
since C' x = {C x ,Cx © © (d- 1)1}, and so 

d-l 



3=0 



d-l 



i=i 

For the rest of this argument we focus on the j '• j£ terms. 
First, each jl when added to the (0, 0, ... , 0) vector will gen- 
erate a codeword of full Hamming weight ( n = d m — 1). Sec- 
ond, each j 1 when added to any other codeword of Cx (other 
than the (0,0,..., 0) vector) results in a codeword with d m_1 
zero's and so Hamming weight n — d m_1 = d m — 1 — d m_1 . 
For each Cx © jl, there are d rn — 1 such codewords and so 

W Cx9jl {x) = £ ( < f "- 1) + (d m - l)x<« jm - 1 - <r " I >. (C2) 

For every j ^ we get the same result and we have d — 1 
such sums, and so 

Wa x {x) = W Lx (x) + (d-l)Wc xB1 (x), 

which expands out into the formula given in the main text with 
x = fi. 



Appendix D: Adding any non-Clifford gate promotes the 
Clifford group to a universal set 

We consider quantum circuits on n qudits of odd prime di- 
mension d. We will show here that the combination of two 
theorems of Nebe, Rains and Sloane shows that the addition of 
any non-Clifford gate to the Clifford group generates a set of 
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unitaries, that is dense in SU(cP). In Ref. Il59l . Theorem 7.3 
implies that any finite group that contains the Clifford group 
must be generated by the Clifford group and a gate propor- 
tional to the identity. Thus the group H generated by the Clif- 
ford group and a non-Clifford unitary (not proportional to the 
identity) cannot be finite and must be of infinite order. 

In Corollary 6.8.2 of (53), it is shown that any closed sub- 
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